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(54) Title: METHODS OF SELECTING A RANDOM PEPTIDE THAT BINDS TO A TARGET PROTEIN 
(57) Abstract 

A method of isolating a random peptide -scaffold peptide fusion protein that binds a predetermined target protein is disclosed. First, 
a library of nucleotide sequences encoding random amino acid sequences are generated and cloned into a first expression vector to make a 
library test vector. The nucleotide sequence is expressed within a yeast cell as a test fusion protein containing a transcriptional activation 
domain, a random peptide domain, and a scaffold peptide domain. The DNA sequence encoding a target protein cloned into a second 
expression vector provides a target vector. The expressed target fusion protein contains a DNA -binding domain and a target protein . Test 
fusion protein -target fusion protein interaction is detected according to a two-hybrid protein-protein interaction assay. Also disclosed are 
an HIV- 1 Rev protein binding peptide isolated by the method of the invention and methods of using the Rev interacting protein. 
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METHODS OF SELECTING A RANDOM PEPTIDF 
THAT BTNnc: T n fl Tfl^ GET PRfi'] '|^r 

Statement- ns t g FM^miv snnn.n^ p ~— ^rn 

This invention was made at least in part with 
funds from the Federal government, and the government 
therefore has rights in the invention. 

Bacftcnround of Trr ,- nMnn 
The invention relates to a method of selecting a 
fusion protein which binds to a target protein of 
interest within a host cell. The invention also relates 

tll s po ( u7v7TT that binds to the human 

virus (HIV, -l Rev protein, and methods of inhibiting HIV- 
1 production using the polypeptide. 

b a! H , Pr ° tein - pr0tein interactions within a cell are the 
basis for countless physiological control mechanism.. 
Viral control mechanisms are the targets of searches for 
the cellular host proteins with which a viral protein 
interacts as part of its function in the cell 

The -two-hybrid" assay, is a genetic screen 

TiZTlll described by Fields et al - (Fieids s - - -i. 

1989) Nature, 3*0:245-246; Fields, s. and Sternglanz R 
(«.4, r,ends in Cenetics, to test ^ 0 Z*- 

target protein interactions. The "two -hybrid" selection 
system utilizes the interaction between two 
interest to reconstitute an active transcription factor 
from two fus.on proteins; one containing a DNA binding 
domain fused to the protein of interest, and the other 
30 ZTIT 9 3 tranSCripti °" al -tivation "gion fused to a 
ITJ ^ (Chien C ' ^ Ct al - (1991 > OS*. 

Test protexn-target protein interaction is indicated by 
expression of a marker gene operably linXed to a promoter 
activated by the reconstituted transcriptional factor 
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Though useful, the two hybrid system cannot assay 
all protein-protein interactions. According to Fields, 
S. and Sternglanz, R. (1994), supra . proteins that cannot 
fold correctly within the cell are not suitable for 
5 selection by this system. 

Viruses are obligate intracellular parasites and 
depend on protein-protein interactions between viral and 
host cell proteins to carry out functions vital to their 
life cycle. Blocking or inhibiting these critical 
10 interactions is a therapeutic strategy that may have 

profound effects on the viral pathogen and little effect 
on the host cell. Furthermore, as mutation of the viral 
protein may be constrained by the requirement for 
functional interaction with host factors, such a strategy 
15 may be resistent to viral escape mutations. 

Current strategies to inhibit replication of HIV 
are commonly aimed at the virus' reverse transcriptase 
using nucleoside analogues like AZT, ddC and ddl . 
However the complex life cycle of this retrovirus offers 
20 unique targets of intervention in the form of small 
regulatory proteins that are now being intensively 
studied. One of these proteins, Rev, is a regulator of 
virion protein expression. 

Rev was first identified in proviral mutants that 
25 were unable to replicate because they did not produce 
detectable levels of Env and Gag proteins 
(Sodroski et al. (1986) Nature, 221:412-417; 
Feinberg et al. (1986) Cell, 1^:807-817). It is now 
clear that Rev induces the synthesis of these structural 
30 proteins by allowing the appearance of their mRNAs in the 
cytoplasm (Malim, M. et al. (1989) Nature, 254-257 ; 

Emerman, M. et al. (1989) Cell, 52:1155-1165). Since the 
mRHAs coding for Gag, Env, and Pol contain introns with 
sequences to which the Rev protein specifically binds, 
35 these mRNAs are normally translocated to the cytoplasm 
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for translation to their respective proteins during HIV 
infection • 

In the absence of Rev protein function, the mRNAs 
encoding Gag, Env, and Pol remain in the nucleus, are not 
5 expressed, and HIV-l replication is inhibited. As no 
cellular protein is known to perform a similar function, 
and because the concentration of the Rev protein must 
exceed a threshold for function (Pomerantz, R.J. et al. 

(1992) J . Virology, ££: 1809-1813) , Rev is considered a 
io prime target for specific antiviral agents (Cohen, J. 

(1993) Science, 26£:1257; Rosen, C.A. (1992) AIDS 
Research and Human Retroviruses, £: 175-181; Duan, L. et 
al. (1994) P.N.A.S. USA, 21: 5075-5079 ) . 

Rev is a small 116 amino acid phosphoprotein that 
is is located predominantly in the nucleus/nucleolus of 
human cells infected with HIV-i (Cullen, B. R. et al. 

(1988) J. Virol., £2:2498-2501; Felber, B. K. et al. 

(1989) P.N.A.S. USA, ££: 1495-1499) . Extensive 
mutagenesis has revealed that Rev contains at least two 

20 domains that are necessary for function: 1) a basic RNA 
binding domain at the N-terminus that binds to a specific 
RNA sequence (RRE, Rev Response Element) present in all 
unspliced HIV mRNAs, and 2) a C-terminal effector domain 
(Malim, M. et al. (1989) Cell, 5fi: 205-214; Olsen, 

25 H.S. et al. (1990) Genes and Dev., 1:1357-1364) necessary 
for function. The effector domain was defined by 
mutations, e.g., M10, that not only abolish Rev function 
but at the same time create transdominant negative 
mutants, i.e., mutants able to repress wild type Rev 

30 function when added in a molar excess (Malim, M. et al. 
(1989) , supra) . The M10 mutation does not affect the 
normal multimerization of Rev nor the ability of Rev to 
bind RNA, suggesting that the region of Rev containing 
the M10 mutation interacts with one or more cellular 

35 proteins (Malim, M. et al. (1991) J\ Virol., $5:4248- 
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4254) and that the interaction with a putative cellular 
target is essential for Rev function. 

fi^ppiarv of the Invention 
In general, the invention features a method of 
5 selecting a random amino acid sequence that binds to a 
target protein in a host cell, e.g., a yeast cell. The 
invention is based on the unexpected discovery that a 
stably folded polypeptide domain (a "scaffold") fused to 
the random amino acid sequence enhances such selection. 

10 In addition, the invention features a random peptide 
detected by the method of the invention, which is not 
naturally expressed in the cell from which its transcript 
was isolated, and which unexpectedly exhibited specific 
binding to the Rev protein effector domain. 

15 it is an object of the invention to enhance the 

ability to isolate random peptides which bind to a target 
protein of interest by expressing the random peptide 
within a known stable 3-dimensional structure, or 
scaffold, as a fusion protein. This fusion protein is 

20 expressed from a nucleic acid sequence encoding a random 
sequence of amino acids covalently linked to a scaffold 
peptide. The stably folded structure of the scaffold 
enhances and/or enables binding of the random peptide to 
the target protein. Binding of the random peptide- 

25 scaffold peptide fusion to the target protein of interest 
is monitored by a two-hybrid selection method (Fields, S. 
et al. (1989) , supra) . 

In general, the scaffold is a peptide sequence 
that stably folds into a 3-dimensional structure and 

30 which, when fused to a random amino acid sequence of the 
invention, enhances presentation of the random sequence 
to the target protein. Specific embodiments of the 
scaffold peptide include nucleotide sequences which 
encode and allow translation of an Adh C-terminus (SEQ ID 
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NO:3); the Bl domain of G protein (Gronenborn, A.M. 
(1991) Science, 252:657-661); a zinc finger domain (Lee, 
M.S. (1989), Science, 245:635); the B domain of proteinA 
from S. aureus (Gouda et al. (1992) Biochemistry 3J..-9665- 
5 9672); the tenth type III module of fibronectin (Baron et 
al. (1991) Biochemistry Al: 2068-2073) ; and cyclophilin 
(Ke et al. (1991) PNAS USA fifi: 9483-9487 ) . The random 
peptide can be linked at either end, or preferably in the 
middle, and most preferably in external loops of the 
10 scaffold, making the random peptide accessible to the 
target protein for binding. 

The invention further features a non-natural 
fusion protein, rip (B ev interacting E rotein, SEQ ID 
NO:2), isolated by expressing a non-naturally translated 
is peptide fused to a scaffold peptide domain, which fusion 
peptide binds to the HIV-1 Rev protein and inhibits its 
function. 

In addition, the invention features a 
substantially pure DNA sequence (SEQ ID N0:4), referred 
20 to herein as the »R32 nucleotide sequence," and the 

deduced amino acid sequence (SEQ ID NO:5). This sequence 
does not include any polylinker or scaffold sequences. 

The invention also features a Rev protein binding 
protein encoded by a DNA sequence of SEQ ID NO: 4 linked 
25 to a scaffold peptide-encoding sequence. 

The invention also features a vector containing, 
in the order of transcription, a nucleotide sequence 
encoding one member of a two-hybrid binding peptide pair 
(i.e. a transcriptional activation domain (Gal4 or Herpes 
30 virus VP16, for example); a nucleotide sequence encoding 
a random peptide sequence to be tested; and a nucleotide 
sequence encoding a scaffold peptide domain. 

In addition, the invention features a second 
vector containing, in the order of transcription, a 
35 nucleotide sequence encoding a DNA binding domain (Gal4 
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or LexA, for example) ; and a nucleotide sequence encoding 
a target protein. It is understood by one of ordinary 
skill in the art that the transcriptional activation 
domain or the complementary DNA binding domain of a tvo- 
5 hybrid binding peptide pair can be varied in position 
with its respective fusion protein as necessary to 
improve transcriptional activation* 

One of the vectors of the two-hybrid system 
contains a nuclear localization signal to allow 

10 translocation of the complex of interacting fusion 
proteins into the nucleus for interaction with the 
transcriptional activation sequences of the marker gene. 
It is also noted that vectors used in selecting a random 
peptide in the context of a scaffold peptide domain can 

is encode the random peptide 5' to the scaffold peptide 
domain (in the order of transcription) , or the random 
peptide-encoding sequence can be inserted into the coding 
sequence of the scaffold peptide domain. 

The invention also features an antibody to the RIP 

20 protein. The antibody may be a monoclonal antibody or a 
polyclonal antibody. For use in assays of the invention, 
the anti-RIP antibody, or the RIP itself, may also be 
labeled for detection such as with a fluorescent moiety 
covalently attached to the antibody, or peptide, without 

25 interfering with antibody binding to the RIP. 

In addition, the invention features an assay for 
the qualitative or quantitative detection of Rev protein 
in a cellular lysate, by inducing a population of cells 
containing or suspected of containing HIV to proliferate, 

30 thereby inducing HIV to replicate and produce Rev 

protein. The cells are lysed and the proteins of the 
lysate are contacted with RIP, e.g., immobilized on a 
solid support, to allow binding of Rev. The presence of 
the RIP-Rev complex is indicated by a decrease in the 
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detection signal observed when Rev is present compared to 
a control sample having no Rev protein. 

The invention further features a method for 
screening candidate agents that inhibit HIV replication 
5 by inhibiting Rev production. Human cells containing 

actively replicating HIV are contacted with the candidate 
agent in an appropriate culture medium. As measured, 
e.g., by the RIP/anti-RIP antibody assay, a decrease in 
the cellular concentration of Rev in treated cells 
10 compared to a control without the candidate agent, 
indicates that the agent is potentially useful for 
inhibiting HIV replication in human cells. 

The invention also features a method of inhibiting 
HIV reproduction in lymphocytes of a mammal (preferably a 
is human) by introducing DNA encoding RIP into the 
lymphocytes where the DNA is expressed and RIP is 
produced in amounts sufficient to inhibit HIV 
reproduction. Moreover, the invention features a method 
of inhibiting the production of HIV-1 in human cells, 
20 e.g., therapeutically or prophylactically , by 

administering the RIP protein to the cells using gene 
therapy techniques. 

By "random peptide" or "random amino acid 
sequence," is meant any peptide that does, or does not, 
25 naturally exist in a cell. For example, a random peptide 
according to the invention can be generated by the 
translation of a nucleotide sequence that does not encode 
a naturally occurring full-length protein, or by 
synthetic methods. The nucleotide sequence encoding a 
30 random peptide is encoded, for example, by a cDNA 

sequence isolated by reverse transcription of a cellular 
sense or antisense mRNA transcript, or by a synthetic 
oligonucleotide containing amino acid codons linked in 
random order. Examples of a nucleotide sequence encoding 
35 a random peptide include, but are not limited to: 1) 
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cellular antisense mRNA reverse transcribed into cDNA 
then expressed as a polypeptide which binds to a target 
protein of interest, and 2) a polypeptide generated by 
the expression of a nucleic acid sequence encoding a 
5 randomly generated amino acid sequence. 

A "stably folded peptide" is a peptide that is 
resistant to denaturation, e.g., thermal denaturation, in 
a cell. 

By "reduced HIV-1 protein production in cells 

10 expressing the virus" is meant a decrease in the 

expression of, for example, essential proteins Gag, Env, 
and Pol in RIP-containing cells compared to the normal 
expression in the same cells lacking RIP. 

By "sufficient concentration of HIV-1 Rev binding 

15 peptide" is meant a concentration of RIP in a cell above 
the threshold necessary to inhibit translocation the Gag, 
Env, and Pol mRNAs from the nucleus into the cytoplasm. 

By "substantially pure random peptide-scaf fold 
peptide fusion protein" is meant that the random peptide- 

20 scaffold peptide fusion protein provided by the invention 
is at least 60%, by weight, free from the proteins and 
naturally-occurring organic molecules with which it is 
naturally associated. Preferably, the preparation is at 
least 75%, more preferably at least 90%, by weight, 

25 random peptide-scaf fold peptide fusion protein. A 

substantially pure random peptide-scaf fold peptide fusion 
protein may be obtained, for example, by expression of 
the recombinant nucleic acid of the library test vector 
encoding it, or by chemically synthesizing the protein. 

30 Purity can be measured by any appropriate method, e.g., 
column chromatography, polyacrylamide gel 
electrophoresis, or HPLC analysis. 

By "substantially identical amino acid sequence" 
is meant an amino acid sequence that differs from another 

35 sequence only by conservative amino acid substitutions, 
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for example, substitution of one amino acid for another 
of the same class (e.g., valine for glycine or arginine 
for lysine) , or by one or more non-conservative amino 
acid substitutions, deletions, or insertions at positions 
5 of the amino acid sequence that do not destroy the 
biological activity of the random peptide-scaf fold 
peptide fusion protein. 

By "operably linked" means that selected DNA is in 
proximity with transcriptional and trans lational 

10 regulatory sequences to allow these sequences to regulate 
expression of the selected DNA. 

Unless otherwise defined, all technical and 
scientific terms used herein have the same meaning as 
commonly understood by one of ordinary skill in the art 

15 to which this invention belongs. Although methods and 
materials similar or equivalent to those described herein 
can be used in the practice or testing of the present 
invention, the preferred methods and materials are 
described below. All publications, patent applications, 

20 patents, and other references mentioned herein are 

incorporated by reference. In addition, the materials, 
methods, and examples are illustrative only and not 
intended to be limiting. 

Other features and advantages of the invention 

25 will be apparent from the detailed description, and from 
the claims. 



ftrj yf Descri ption of tne Drawings 
Fig. 1 is a diagram of the results of HIV antigen 
production and Rev binding experiments comparing 
30 characteristics of wild type Rev protein and various 

mutant Rev proteins. Data for HIV-1 gtat and P 24 antigen 
production were obtained from Malim, et al. (1991), 
supra. RIP-Rev binding experiments were performed by the 
method of the invention. 
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Fig. 2 is a diagram of R32 clones used to evaluate 
regions necessary for Rev binding activity. 

Fig. 3A is a diagram showing various N-terminal 
deletions of the R32 nucleotide sequence starting at the 
5 EcoRI site. The Xhol site was filled in with a- 
thionucleotides to achieve unidirectional deletions 
according to standard methods (Henikoff (1984) Gene, 
2&:351-359) . 

Fig. 3B is a diagram showing the structure of the 

10 initial test vector construct and of active test vector 
constructs after bidirectional exonuclease III deletions. 
The deletions were started from the Xbal site. 

Fig. 4 is a representation of the nucleotide and 
deduced amino acid sequences of the Rev interacting 

15 protein, RIP. The RIP coding sequence consists of the 

downstream 175 nucleotides of the originally isolated R3 2 
sequence (from the human MCM2 homologue in antisense 
orientation) linked to DNA linker sequences from the 
cloning vector cDNA f 15 nucleotides from the yeast ADH 

20 gene, and 108 nucleotides encoding 3 5 amino acids plus a 
stop codon (SEQ ID NO: 3) from the Adh C-terminus. The 
shaded portions have been found to be dispensable for 
binding to Rev in C-terminal deletion experiments, but 
were included for cloning convenience in the experiments 

25 expressing RIP in mammalian cells. 

Fig. 5 is a graph showing the inhibition of HIV-l 
gag p24 antigen production by increasing amounts of RIP- 
encoding DNA introduced into HeLa cells by cotransf ection 
with HIV-l infectious molecular clone, pNL4-3 (Adachi, 

30 A. et al. (1986) J . Virol . £2:284-291). 

Figs. 6A and 6B are graphs showing the inhibition 
of HIV production in CEM lymphocytes by RIP protein (Fig. 
6A) ; and showing that RIP-induced inhibition is specific 
to HIV-l Rev since RIP does not affect expression of 

35 another protein, alkaline phosphatase, in the CEM 
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binding domain fused to a protein X; and 2) an activatior 
domain fused to a protein Y. When both hybrid proteins 
are present, the noncovalent interaction of X and Y 
bridges the interaction of the DNA binding domain with 
5 the DNA binding site, and the interaction of the 
activation domain with the transciptional machinery, 
thereby activating transcription of the reporter gene. 
In the general assay, to select a protein Y, that binds 
to a target protein X, Y is encoded by a pool of plasmids 

10 in which total cDNA or genomic DNA is ligated to the 
activation domain (Fields, S. and Sternglanz, R. (1994) 
Trends in Genetics, lfl:286-292). 

The following examples provide support for the 
discovery that a scaffold peptide domain fused to a 

15 random peptide enhances binding selection of the random 
peptide in a host cell, e.g. , a yeast cell. 

Examples 

Examp l e l ; A Method of Isolating a Random Pep tj<^- 

SCflffQld Fusion Protein That Bin ds a Specific Targe t 

20 Protein 

Selection of a random peptide in the "two-hybrid" 
assay, is limited by the ability of the random peptides 
to fold correctly within the cell (Fields, S. and 
Sternglanz, R. (1994) Trends in Genetics, 1£ : 286-292 ) . 

25 Most random peptide sequences are unlikely to fold into a 
stable compact protein. As described in the examples, 
the unexpected finding that a Rev interacting protein 
required an Adh scaffold sequence (SEQ ID NO: 3) for 
binding demonstrates a need for a stably folded scaffold 

30 peptide to be fused to the random amino acid sequence to 
be selected. 

To provide a useful scaffold peptide domain, the 
Adh transcription termination sequence was expressed as a 
read-through sequence which accompanied transcription of 
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10 



the random amino acid sequence to be selected by the 
method. The observed binding requirement for the Adh 
transcription termination sequence indicated that the Adh 
sequence was necessary as a molecular scaffold that 
stabilized the entire fusion protein, and thus the random 
peptide. Thus, the 3 5 amino acids, 

LVTCSPXKWGLSTI^EIYEKMEKGQIVRYVVDTSK (SEQ ID NO: 3) from 
the Adh transcription termination sequence are required 
to be present in the expressed fusion protein to serve as 
the scaffold for the purpose of structural presentation 
of a fused random peptide in a two hybrid selection 
system* 

Another scaffold protein sequence useful in the 
method of the invention is the Bl domain of the 
15 streptococcal protein G. This protein domain is a single 
immunoglobulin binding domain, and the folded structure 
has been extensively studied (Gronenborn, A.M. et al. 
(1991) ScIbticg, £52:657-661) , but never used in the two- 
hybrid system. The Bl protein domain forms a stably 
folded structure which is not denatured below a 
temperature of 87*C, exhibits reversible thermal 
denaturation, and maintains its native conformation in 
aqueous solution containing up to approximately 8 M urea. 

DNA encoding a random peptide is preferably 
inserted between amino acids 9 and 12 of the Bl domain at 
the site of a loop between two beta sheets (see 
Gronenborn, A.M. (1991), supra, for Bl structural 
analysis) , thereby providing optimal presentation of the 
random peptide for binding to the target protein. The 
transcriptional activation domain is fused to the N- 
terminus of the B 1 doma in . 

Other scaffold peptides described above can be 
used in a similar fashion. 

In general, libraries of test vectors are 
35 constructed in which each vector contains, in the 



20 



25 



30 



WO 96/23899 



PCT,X'S95/09589 



- 14 - 



direction of transcription, a transcriptional activation 
domain, a DNA sequence encoding a random amino acid 
sequence, and a DNA sequence encoding a scaffold peptide 
such as the Adh C-terminal sequence (SEQ ID NO: 3) or the 
s Bl domain of Streptococcal protein G (Gronenborn, A.M. 
(1991) Science, 252:657-661). A target vector is also 
constructed which contains, in the order of 
transcription, the Gal4 DNA binding domain linked to the 
coding sequence of a target protein of interest, e.g., 
10 the entire Rev coding sequence. 

The relative position of the target domain and the 
Gal4-DNA binding domain can be reversed and still 
generate a functional target fusion protein. Also, as 
noted supra, other binding domains such as LexA can be 
15 used in the assay. in addition, the yeast expression 
vectors of the invention contain appropriate 
transcriptional and translational regulatory sequences 
and nuclear localization signal coding sequences for 
expression and nuclear localization of the test and 
20 target fusion proteins. 

The interaction of the test random amino acid 
sequence and the target domain in a cell provides a 
functional GAL4 transcriptional activator resulting in 
the expression of a marker gene (e.g. lacZ) under the 
25 transcriptional control of a GAL 4 promoter. A positive 
interaction is detected as blue yeast colonies on yeast 
culture solid medium containing X-Gal. 

As the length of the random amino acid sequence 
increases, the number of possible random fusion proteins 
30 increases exponentially. For example, a random 20 amino 
acid sequence can have any one of 2 0 naturally occurring 
amino acids at each position, or 20 20 different sequences. 
The Rev effector domain as an example of a target domain 
is likely to bind multiple sequences, increasing the 
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in the art (HeLa s/atccTc ^T" " 
The HuLa cDNA library vector DCP15B 
"> constructed as follows. The Ja»Hi / 

(BecKer et al. U991) - ™\ £ ™*»ent for DB20 

fining an Jh prooMer and ^' 1968-1972 > 
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adapter, was inserted into the BstXI sites of pCF128 to 
generate the test library. 

The library contained approximately 4xl0 6 
independent clones. This library was transformed into 
5 the yeast strain GGY1 (Gill, G. et al. (1987) Cell 

5^:121-126) containing an integrated lacZ reporter under 
GAI*4 control and expressing a Gal4 (1-14 7) -Rev fusion. 

From approximately 2xl0 6 yeast transf ormants, 27 
primary positive clones were recovered. Further 
10 screening of the primary clones eliminated false positive 
clones. Candidate plasmids were retransf ormed into GGY1 
together with constructs expressing either the Gal4 DNA 
binding domain Gal4 (1-147) alone, the Gal4 fusion Gal4- 
Rev, or the fusion Gal4-Rev(M10) (Table 1) . 

Class of Clones 
I II III 

Constructs 

Gal4 (1-147) 
20 Gal4-Rev 

Gal4-RevM10 
Number of isolates: 

Of the 27 candidates, 12 library plasmids 
activated transcription when transformed into yeast 

25 together with either one of the Gal4-Rev fusions, but not 
together with Gal4 (1-147) (Class II clones). These 
cDNAs presumably encode proteins that bind to parts of 
the Rev protein that are not affected by the M10 mutation 
in the effector domain. Only one of the plasmids (R32) 

30 was able to discriminate between wild-type Rev and Rev- 
M10, causing reporter gene activation together with Gal4- 
Rev, but not Gal4-Rev (M10) (Class III clone). This 
indicates a specific binding of the protein product of 
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R32 to the effector domain of Rev. The R32 plasmid 
contains a 377 bp cDNA fragment inserted into vector 
PCF128 (supra). The expressed fusion protein contains 
the VP16 activation domain, the R32-encoded peptide, and 
s the Adh scaffold peptide and was determined to be a Rev 
interacting protein in subsequent binding studies. 

Example 3: .Sp pcj f j c i ty of the Rev int^^ lon nr-o^ in - 
Rev Bind^ T 

To further assay the binding specificity of the 
io isolated Rev interaction protein to Rev, the R32 plasmid 
was transformed into yeast together with Gal4 fusions to 
a number of different Rev mutants that have previously 
been described (Malim et al. (1991), supra). Binding of 
RIP to Rev wild-type and mutants was assayed by the 
as method of the invention. The production of HIV-i gtat 
and HIV-l gag p24 assayed by Malin et al. (Malim et al 
(1991), supra) is tabulated in Fig. i to illustrate the 
activity of the Rev mutants relative to wild type Rev. 
All of the mutants tested contain mutations in the Rev 
effector domain between amino acids 78 and go, but not 
all of these mutations abolish Rev function (Fig. l) . 

A high correlation exists between the function of 
these Rev mutants and their ability to bind to the rip 
protein encoded in part by the original R3 2 sequences 
25 (Fig. i) . Mutants M20 and M25 that retain some Rev 
activity according to Malim et al. (1991), supra, bind 
RIP; whereas M10, and the single point mutants M27 and 
M29 that have no Rev activity did not show any detectable 
binding to RIP. Mutant M22 does not show a correlation 
30 between RIP binding and Rev function and this may be due 
to alteration of the amino acid and its charge at 
position 83 when the amino acid is changed from Leu to 
Asp. 
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Examp l e 4; The R32 Sequence of rip is Fncoded on th ? 

Ant i sense Strand Pf a Human MCM 2 Hanm^ 

When the R32 sequence was compared with sequences 
in the databank (Genbank) there was no exact match, 
however some amino acid sequence homology was found on 
the antisense strand of R32 with the yeast MCM2 protein 
(Yan, H. et al. (1991) Genes and Davelopment , 5:744-957). 
A human AcDNA clone (AcDNA library #93 6201, Stratagene, 
La Jolla, CA) corresponding to this region showed the R32 
sequence on the antisense strand of the hMCM2 homologue. 
A Northern blot probed with a strand-specific RNA probe 
to detect messages encoding the R32 open reading frame 
yielded no signal above background signal; whereas the 
same blot hybridized with a randomly radiolabeled R32 
15 probe (i.e. both sense and antisense strands were 
radiolabeled) detected one moderately abundant 
transcript of 4.3 kb, corresponding in size to the human 
MCM2 homologue. These findings indicate that the R3 2 
sequence is not part of a cellular protein coding 
transcript, but rather represents a sequence selected 
from the 'random' sequence pool of transcripts in 
antisense orientation present in the cDNA library. 

To confirm this hypothesis, fusion proteins 
expressed by various constructs were tested for Rev 
25 interaction (Fig. 2) using the two-hybrid assay. Becker 
et al. reported that the Adh terminator contains a 
cryptic promoter capable of transcribing upstream 
sequences (Becker, D. M. et al. (1991) PSAS £1:1968- 
1972) . Fusion of the human MCM2 gene in frame with the 
30 VP16 activation domain did not result in transcriptional 
activation in yeast when cotransf ormed with the Gal4-Rev 
construct (Fig. 2, construct 3). Thus, the R32 fusion 
protein, and not the hMCM2 homologue, is responsible for 
Rev binding. 
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Surprisingly, introduction of , stop codon 
directly after the M2 sequence also abolis „ ed 
binding , rig . 2 , construct /2) closer 4 

sequence revealed that in the original clone there was no 
• stop codon after the cloned COHA , R32 , , adding to the 
open reading frame l6 amino acids fro. poiyliLr 
sequences and 35 amino acids from the c-ter»inus of the 

used in the library test vector. This was confix « 

10 ^rr et /s (pig - " uhe - R22 °™ ™™» - 

"tisense strand of the human HC M 2 open reading frame 

stop rr a R32 " adh fusion uas oi °"' d °< ^e 

«op C od=„ „ clone « (Pi,. 2) , . £u „ ctional 
is again ootained <rig. 2 , construct «, . A fI J e 

construct (Fig. 2 , construct /<). 

Thus, the functional Rev interaction protein 

»l ::;tis r^r £rom p ° iyi1 ^ - d 

16 Flg - The fact that vector 

sequences were required for the interaction between R3 2 - 
encoded amino acids and Rev indicates that R32 waTnot 

2 s pt p e tL h e yS s i0l ° 9iCal ™ ^ ~* " ^n S ts n0 a 

peptide sequence selectable when fused to a scaffold 
Peptide domain, the Adh c-terminal sequence. 

fiXflptple -ji; Delineat ion ^ 

Reouir-^ frr R ^„ p 1nr11||IJ 

To determine the minimal amino acid sequence 
30 required for Rev bindina N - ann r> «. ^ 

j-naing, N- and C-termmal Exolll 

deletxons were performed using construct / 5 (Fig 2 > 
which en coded a functional Rev interaction protean ' 

TtZrlTi: R32 -T SC3ff0ld fUSi0n) ' Pl3Smid or 
different tame points fro* unidirectional Exolll deletion 
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experiments were cotransf ormed into yeast with the Gal4- 
Rev target vector. Plasmids were rescued from positive 
yeast colonies. The shortest functional peptide 
recovered from an N-terminal deletion series had 203 of 
5 the 377 bp of the original R32 sequence deleted. The 
construct was transformed into yeast strains expressing 
Gal4( 1-147), Gal4-Rev, and Gal4-RevM10 to assess binding 
specificity. The N-terminal deletion peptide not only 
retained full discrimination between Rev and RevMlO but, 

10 judging from the levels of £-Gal produced by these 

strains , the interaction between the N-terminal deleted 
Rev interaction peptide and Rev was stronger than the 
interaction detected with the original Rev interaction 
protein isolate. 

is To determine the number of C-terminal amino acids 

that could be deleted from the original Rev interaction 
protein isolate without loss of Rev binding activity, a 
similar strategy of ExoIII deletions was used. These 
manipulations were again performed with clone #5 (Fig. 

20 2) . Surprisingly, the active constructs from late 

deletion time points all showed the same structure: the 
polylinker and the Adh domain upstream of the stop codon 
had been deleted. Also, all of the recovered active 
clones no longer had the stop codon and were now in frame 

25 with the downstream Adh domain. Since, in these 

experiments, no active clones were recovered which lacked 
the downstream Adh domain fused in frame with upstream 
R3 2 sequences, conclusions regarding a minimal active 
protein can be drawn. Only one amino acid can be deleted 

30 from the C-terminus of the R32 sequence, and the 

polylinker sequences are dispensable, whereas all 35 
vector-encoded amino acids (SEQ ID NO: 3) originating from 
the yeast Adh protein are necessary for an interaction 
with Rev. The minimal amino acid sequence, RIP (£ev 

35 interacting protein, SEQ ID NO:2), is shown in Fig. 4. 
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The original R32 clone was 377 nucleotide base 
pairs in length and encoded 125 amino acids. The two- 
hybrid library test vector polylinker added 16 amino 
acids and the last 35 amino acids were contributed fr. 
the Adh terminator in the expression cassette. Deleti_.. 
analysis showed that 67 of the unique 125 amino acids as 
well as the polylinker sequences could be deleted from 
the construct without losing Rev binding ability. The 
minimal protein sequence therefore consists of 58 amino 
acids encoded by the hMCH2 antisense fragment fused to 
the Adh scaffold of 35 amino acids (SEQ ID NO: 3) of the 
Adh C-tenainus . 



Examp l e 6; RIP Expression Tnhi Mts htv v^al Pt-oJ-^ 

Expression 

Rev is essential for HIV replication in human 
cells and the Rev effector domain is required for its 
function (Malim, M. et al. (1989), supra). The RIP 
peptide, containing a portion of a cDNA isolated from the 
random pool of normally untranslated messages fused to 
-he Adh-derived scaffold, was tested to determine if its 
*ev binding ability correlated with an ability to inhibit 
*ev function. 

For these tests, the HIV-l infectious molecular 
:lone, pNL4-3 (Adachi, A. et al. (1986) J. Virol. 5_9_:284- 
91), which directs the expression of infectious HIV-l 
irions following transfection into a variety of cell 
ypes, was used as the source of HIV in human cells, 
reliminary experiments were performed in HeLa cells 
HeLa S3 ATCC #CCL 2.2) . HeLa cells were grown in 
onolayers in DMEM media (GIBCO, Grand Island, NY) 
upplemented with 10% fetal bovine serum. Cells were 
otransfected with pNL4-3 and either a CMV 
romoter-driven expression vector (pcDNAl, Invitrogen, 
in Diego, CA) or the identical expression vector 
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containing the RIP cDNA (SEQ ID NO:l) (pcRIP) cloned into 
the tfindlll and EcoRI sites of pcDNAl . 

Transfections were performed on monolayer cells by 
introducing 2 0 M g of total plasmid DNA by the calcium 
5 phosphate procedure (Chen C. et al. (1988) Biotschniques , 
4:632-638) using 5 nq of LTR-CAT (chloramphenicol 
acetyltransf erase) reporter construct, 5 to 15 »q of test 
plasmid DNA, and carrier DNA. Suspension cultures were 
transfected with DEAE-dextran (Margolis et al., (1993) j. 
10 Virology, 122:370-374) using 4 ng of pNL4-3 and 4-12 M g 
of test plasmid DNA. To assay HIV-i replication, 
aliguots of culture media were sampled for detection of 
HIV-l gag P 24 protein (Veronese, et al. (1985) P.N.A.S. 
OSA, fii: 5199-5202). Assays were performed using an 
is antigen-capture ELISA kit (Coulter, Hialeah, FL) . 

Inhibition of HIV-l production increased with 
cotransfection of increasing amounts of rip expression 
vector relative to a constant amount of P NL4-3 HIV-l 
molecular clone (Fig. 5) . Expression of a co-transf ected 
HIV LTR-driven CAT reporter gene in these cells was 
unaffected by RIP expression, demonstrating that 
inhibition of virus production was not due to 
transcriptional effects, or non-specific toxicity. 



20 
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Example ?: — rip inhibits HIV ProdMr t ion in T.Y » rh »^^^ 

As lymphocytes are a primary target of HIV-l 
infection in vivo, further experiments were performed in 
lymphocyte cell lines. m CEM cells, cotransfection of 
RIP-encoding DNA at a transfection ratio of 2:1 (4 M g 
PNL4-3 and 8 nq pcRIP) resulted in inhibition of HIV-l 
30 antigen production up to twenty- fold greater than that 
mediated by cotransfection of the dominant negative Rev 
mutant M10 (Fig. 6A) . Previous studies demonstrating 
potent inhibition by Rev M10 in similar cotransfection 
studies have used transfection ratios of 10:1 (Malim et 
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al. (1989) , supra) • Thus, RIP is a more potent inhibitor 
requiring a two-fold excess (2:1) of RIP to inhibit 
whereas a 10-fold excess (10:1) of M10 is required for 
inhibition of HIV-1 antigen production. Cell viability 
5 and cell growth were unaffected by RIP expression as 
tested using well known techniques of cell counting and 
trypan blue assay, Cotransf ection of a secreted alkaline 
phosphatase reporter gene (pCMU/SEAP) (Tropix Inc., 
Bedford, MA; Cullen, B. et al. (1992) Meth. Enzyme, 

10 362-368) showed that RIP did not impair the ability 

of these cells to express transfected genes (Fig. 6B) . 
Decreased expression of alkaline phosphatase at later 
time points in cells cotransf ected with Rev M10 or pcDNAl 
may reflect cell killing by HIV-1. Similar 

15 cotransfection experiments performed in H9 lymphocytes 
also demonstrated the ability of RIP to inhibit virion 
production in a dose dependent manner (Fig. 7) . 

T cell lines, CEM and H9 were maintained in 
suspension culture in RPMI 1640 media (ICN Biochemicals , 

20 Cosa Mesa, CA) supplemented with 10% fetal bovine serum 
(GIBCO, Grand Island, NY) . 

Fvamnle 8: RIP Inhibition of HIV Is Not Due t<? 
Antisense-Media ted Inhibition of MCM2 Expression 

Since most of the RIP peptide has been shown here 

25 to be encoded by the antisense strand of a human MCM2 
homologue, an important control is to show that the 
observed inhibitory affect on HIV-1 production was not 
due to antisense transcript-mediated inhibition of 
endogenous hMCM2 . The yeast MCM2 protein has been shown 

30 to be involved in plasmid maintenance (Yan f H. et al. 

(1991) Genes and Development, £:944-957) , and is thought 
to be involved in the initiation of DNA replication 
(Yan H. et al. (1991), supra). A frame shift mutation 
was generated directly after the ATG in the RIP 
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expressing vector. This plasmid therefore retains all of 
the antisense sequence, however it is not able to express 
a Rev binding peptide. Fig. 7 shows that such a 
construct (pc ARIP) has no antiviral activity. 

5 Example ?; Assay to Detect the Presence and/or Quantity 

Pf Rev in a Cell Lvsate Using RIP and Anti-RTP Antibody 

The RIP of the invention as well as anti-RIP 
antibodies are useful in an assay for the presence of Rev 
in cells of a patient. Thus, the assay is useful as a 
10 diagnostic tool to determine whether cells (e.g. 
lymphocytes) of a patient contain the HIV virus. 

To determine the presence of actively replicating 
HIV in a cell population, lymphocytes of a human 
suspected of being infected with HIV-1 are collected, 
is suspended in an appropriate culture medium to induce 

cellular and viral replication and induce expression of 
viral proteins such as Rev. The cells are then lysed and 
the supernatant containing Rev is contacted with RIP 
immobilized on the surface of the well of a standard cell 
20 culture dish as shown in Fig. 8 (Steps A and B) . After 
specific binding of Rev to the immobilized RIP , the 
remaining, unbound proteins of the lysate are removed by 
washing (Fig. 8, Step C) . Next, an anti-RIP antibody is 
contacted with the proteins in the cell culture well 
25 (Fig. 8, Step D) . The anti-RIP antibody binds to the 
remaining free RJP peptides. When Rev protein is not 
present in the lysate, the signal is high. When the 
lysate contains Rev, the signal is reduced due to a 
decrease in the number of accessible RIP proteins in the 
30 well. 

The anti-RIP antibody can be directly labelled 
with a detectable tag such as radiolabelling with 125 I or 
a covalently bound fluorescent moiety, e.g., fluorescein. 
Further, unlabeled anti-RIP antibodies can be used, and 
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the signal generated by the secondary binding of a 
labeled antibody to anti-RIP antibody. Further, it is a 
feature of the invention that, under the conditions of 
the assay, the anti-RIP antibody binds RIP with reduced 
5 affinity relative to the binding of Rev to RIP - Thus, 
the Rev protein remains bound to RIP during the assay for 
reliable qualitative or quantitative determination of the 
presence of Rev in cells. 

The assay is also useful in screening candidate 

10 agents that inhibit viral production by inhibiting the 
expression of essential viral proteins such as Rev. 
According to this embodiment of the invention, a cell 
culture containing actively replicating HIV is contacted 
with a candidate agent in an appropriate culture medium. 

15 The cells are assayed as above for Rev protein. 

Successful agents are those which reduce the amount of 
Rev protein by at least approximately 90% in the treated 
cells relative to untreated control cells. 

Standard protocols are followed to carry out an 

20 assay method of the invention. For example, after 

bringing the immobilized RIP into contact with a cell 
lysate, the surface is washed to remove unbound material. 
Further, standard blocking protocols are generally used 
prior to contacting the washed surface with labeled 

25 antibodies. An alternative to using cells infected with 
HIV is to provide acutely virus infected cells. For 
example, lymphocytes are obtained from uninfected persons 
and infected with a defined amount of HIV. The cells are 
then treated to remove input virus and cultured. 

30 Isolation of RIP of other selected random peptide- 

scaffold fusion proteins and production of antibodies to 
them are performed using standard techniques well known 
to those of ordinary skill in the art. The selected 
random peptide-scaf f old peptide of the invention can be 

35 produced by first transforming a suitable host cell with 
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the nucleotide sequence encoding the random peptide- 
scaffold fusion protein cloned into a suitable expression 
vehicle followed by expression of the desired protein or 
polypeptide . 

5 Those of ordinary skill in the field of molecular 

biology will understand that any of a wide variety of 
expression systems can be used to provide the protein or 
polypeptide. The precise host cell used is not critical 
to the invention. The polypeptide can be produced in a 
10 prokaryotic host (e.g., E. coli) or in a eukaryotic host 
(e.g., SACch&romycBs cerevisi&&) . The method of 
transformation of the cells and the choice of expression 
vehicle will depend on the host system selected. Methods 
described herein provide sufficient guidance to 
15 successfully carry out the production, purification and 
identification of RIP or other selected random peptide- 
scaffold fusion protein. 

Once the RIP or other selected random peptide- 
scaffold fusion protein is expressed, it is isolated, 
20 e.g., using immunoaf f inity chromatography. In one 

example, an anti-RIP antibody can be attached to a column 
and used to isolate RIP. Lysis and fractionation of RIP- 
containing host cells prior to affinity chromatography 
can be performed by standard methods. Once isolated, the 
25 recombinant protein can, if desired, be further purified, 
e.g., by high performance liquid chromatography (see, 
e.g., Fisher, Laboratory Techni ques In Biochemistry and 
Molecular Biology, eds., Work and Burdon, Elsevier, 
(1980)). RIP or other peptides can also be produced by 
30 chemical synthesis by standard solution or solid phase 
peptide synthesis techniques. 

Substantially pure RIP or other random peptide- 
scaffold fusion protein can be used to raise antibodies. 
Antibodies directed to the polypeptide of interest are 
35 produced as follows. Peptides corresponding to all or 



WO 96/13899 



PCT/US95/09589 



- 27 - 



15 



part of the polypeptide of interest are produced using a 
peptide synthesizer by standard techniques, or are 
isolated and purified as described above. The peptides 
are coupled to KLH with m-maleimide benzoic acid N- 
5 hydroxysuccinimide ester. The KLH-peptide is mixed with 
Freund's adjuvant and injected into animals, e.g. guinea 
pigs or goats, to produce polyclonal antibodies. 

Monoclonal antibodies can be prepared using the 
polypeptide of interest described above and standard 
io hybridoma technology (see, e.g., Kohler et al., Nature 
(1975) 2££:495; Kohler et al., Eur. J. Immunol. (1976) 
6:292; Kohler et al., Eur. J. Immunol. (1976) 6:511; 
Hammer ling et al., in Monoclonal Anti bod ies and 7 ] 
HVbr i flPCTflg , Elsevier, NY, (1981), which are incorporated 
herein by reference) . Antibodies are purified by peptide 
antigen affinity chromatography. 

Once produced, antibodies to RIP, for example, are 
tested for specific RIP binding by Western blot or 
immunoprecipitation analysis by standard techniques. 

20 Uses 

The method of isolating a random peptide-scaf f old 
peptide fusion protein is useful to discover peptides 
that bind specifically to selected target proteins. As 
described above, the binding of the random peptide is 

25 enhanced by the 3-dimensional structure assumed in the 
context of the stably folded scaffold peptide. 

The fused scaffold structure is also useful to 
determine the 3-dimensional structure of the random 
peptide such that small organic molecules can be designed 

30 to mimic some of its structural features. These small 
organic molecules are useful as potential therapeutic 
agents where binding to the target protein affects its 
function, e.g., inhibition of Rev protein. 
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The isolated random peptide-scaf f old fusion 
protein RIP is useful as an inhibitor of HIV production 
in cells, such as human lymphocyte cells. RIP and 
antibodies to RIP are useful in an assay for the presence 
5 of Rev in a lysate of cells suspected of being infected 
with HIV. 

RIP can be used to treat or prevent HIV infection 
by introducing a DNA sequence (SEQ ID NO:l) encoding RIP 
(SEQ ID NO: 2) , or Rev interacting analogues of the 
10 original R32-scaffold fusion, into the genome of a human 
cell infected with, or likely to be infected with, HIV. 
RIP is expressed in the cell at sufficient concentration 
within the cell to inhibit Rev function and, thereby 
inhibit HIV replication. 
15 Cells which are transfected with RIP-encoding DNA 

sequence include cells that can be infected by HIV such 
as CD4 + expressing cells. In one method, bone marrow 
stem cells from an infected patient are removed from the 
patient and then infected with a an appropriate viral 
20 vector containing RIP-encoding DNA sequence by standard 
gene therapy techniques well known in the art (see e.g., 
Morsy, M. A. et al. (1993) J AM A 2JJ2. :2338-2345) . The 
patient's remaining bone marrow is preferably destroyed 
by irradiation or chemical ablation to eliminate any HIV- 
25 infected stem cells that do not express RIP. The 

transfected stem cells are reintroduced into the host 
from which they were isolated (autologous 
transplantation) by injection into a vein of the host 
(see, Biron, P. et al. (1993) in Autologous Bone Marrow 
30 Transplantation; Proceedings of the First International 
SYPPPgjyffi, Dicke, K.A. et al. (eds.), p. 203 for relevant 
techniques) . 

The transfected stem cells are allowed to 
repopulate the bone marrow of the patient and provide 
35 CD4 + cells expressing RIP which inhibits HIV replication 
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in these cells when infected with the virus. This 
results in simultaneously treating and protecting the 
host from disease resulting from HIV infection and 
replication. Allogenic transplantation of stem cells 
5 expressing RIP from an uninfected host is another 
embodiment of the invention and provides the same 
protection to transfected cells and their progeny from 
subsequent HIV replication. 

Another use for a random peptide isolated by the 
10 method of the invention involves analyzing the structure 
of the random peptide in the context of the scaffold 
peptide for designing small molecule therapeutic agents. 
First, DNA encoding a random peptide is inserted into the 
coding sequence of the scaffold peptide, e.g., the Bl 
15 domain of G protein. In this context, the random peptide 
is displayed on the surface of a stable protein and 
therefore is less likely to be degraded. Also the 
structural constraints of the scaffold may maintain the 
peptide in an active conformation. The structure of the 
20 random peptide necessary to interact with the target 
protein is deduced by comparing the 3 -dimensional 
structure of the scaffold peptide domain with and without 
the inserted random peptide sequence, e.g., by using NMR 
analysis (Heinz et al. (1994) J. Mol . Biol, life: 869-886) . 
25 Such an analysis would not be possible without the 

structural stabilizing effects of the scaffold peptide 
domain in combination with the previously solved NMR 
analysis of the scaffold structure. 

Peptides that are not constrained by a scaffold 
30 peptide domain may not have a defined structure in 

solution. They assume a specific structure only when 
binding to their target protein, i.e., an induced fit. 
Structural analysis of the random peptide inserted in a 
scaffold peptide domain obviates the need for analysis of 
35 the target protein-random peptide complex. 
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Three-dimensional information of an effective 
binding structure provides userul information for 
designing small organic molecules that can also bind the 
target protein. For example, a small organic molecule 
5 which mimics an aspect of the 3 -dimensional structure of 
RIP can be designed and tested for its ability to inhibit 
HIV production in cells. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

<i> APPLICANT: University of Massachusetts Medical 

1 Center 

5 (ii) TITLE OF INVENTION: METHODS OF SELECTING A RANDOM 

5 (Xi) PEPTIDE THAT BINDS TO A TARGET 

PROTEIN 

(iii) NUMBER OF SEQUENCES: 7 

10 (lv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson 

(B) STREET: 225 Franklin Street, Suite 3100 

(C) CITY: Boston 

(D) STATE: MA 

15 (E) COUNTRY: USA 

(F) ZIP: 02110-2B04 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

20 <C> OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release /1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

25 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/381,800 
<B) FILING DATE: 01-FEB-1995 

(viii) ATTORNEY /AGENT INFORMATION: 
(A) NAME: Clark, Paul T. 
30 (B) REGISTRATION NUMBER: 30,162 

<C) REFERENCE /DOCKET NUMBER: 04020/062WO1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE : < 617 ) 542-S070 

(B) TELEFAX: (617)542-8906 
35 <C) TELEX: 200154 

(2) INFORMATION FOR SEQ ID NOil: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 330 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : single 

(D) TOPOLOGY x linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
ACGTACCTTG TGCTTGCCAC CTGGCTTTTT GGGCTCCCCT CCGAACAGGG CCAGAGCCAG 60 
4CJCCTCTCTTG ATGTCTTCAT GACCAGAGAT GGAAGGAGCA ATGCTGGCAA AGATCTTCTC 120 
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TCCGATCTCC TCATCCTTGG AGAGGCTAGT GATCATCTTC ACATCTTCAT CGCTCCTGGC 
GAGCACACTG GCGGCCGCTC GAAGCTTTGG ACTTCTTCGC CATTGGTCAA GTCTCCAATC 
AAGGTTGTCG GCTTGTCTAC CTTGCCAGAA ATTTACGAAA AGATGGAAAA GGGTCAAATC 
G TT AG ATA CG TTGTTGACAC TTCTAAATAA 



£2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 109 amino acids 
<B) TYPE: amino acid 
(C) STRAND ED NESS : not relevant 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Thr Tyr Leu Val Leu Ala Thr Trp Val Phe Gly Leu Pro Ser Glu Gin 
15 10 15 

15 Gly Gin Ser Gin Ala Ser Leu Asp Val Phe Met Thr Arg Aap Gly Arg 
20 25 30 

Ser Ann Ala Gly Lya Aap Leu Leu Ser Aap Leu Leu lie Leu Gly Glu 
35 40 45 

Ala Ser Aap His Leu Hie lie Phe lie Gly Pro Gly Glu His Thr Glv 
20 50 55 60 

Gly Arg Ser Lya Leu Trp Thr Ser Ser Pro Leu Val Lya Ser Pro lie 
65 70 75 80 

Lya Val Val Gly Leu Ser Thr Leu Pro Glu lie Tyr Glu Lya Met Glu 
85 90 95 



25 



Lya Gly Gin lie Val Arg Tyr Val Val Aap Thr Ser Lvo 
100 105 



(2) INFORMATION TOR SEQ ID NO:3x 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 amino acida 
30 (B) TYPE: amino acid 

(C) STRAND ED NESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

35 Leu Val Lya Ser Pro lie Lye Val Val Gly Leu Ser Thr Leu Pro Glu 
1 5 10 15 

lie Tyr Glu Lya Met Glu Lya Gly Gin lie Val Arg Tyr Val Val Aap 
20 25 30 
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Thr Ser Lye 
35 



(2) INFORMATION TOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 175 base pairs 

(B) TYPE t nucleic acid 

(C) STRAND ED NESS x single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE i DNA 
10 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4: 
ACGTACCTTG TGCTTGCCAC CTGGGTTTTT GGGCTCCCCT CCGAACAGGG CCAGAGCCAG 60 
GCCT C T C T T G ATGTCTTCAT GACCAGAGAT GGAAGGAGCA ATGCTGGCAA AGATCTTCTC 120 
TCCGATCTGC TGATCCTTGG AGAGGCTAGT GATCATCTTC ACATCTTCAT CGGTC 175 

(2) INFORMATION FOR SEQ ID NO: 5: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 58 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : not relevant 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Thr Tyr Leu Val Leu Ala Thr Trp Val Phe Gly Leu Pro Ser Glu Gin 
15 10 15 

Gly Gin Ser Gin Ala Ser Leu Asp Val Phe Met Thr Arg Asp Gly Arg 
25 20 25 30 

Ser Asn Ala Gly Lys Asp Leu Leu Ser Asp Leu Leu lie Leu Gly Glu 
35 40 45 

Ala Ser Asp His Leu His lie Phe lie Gly 

50 55 



3p2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : b ingle 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO16: 
TTCGTCAAGT CTCCAATCAA GGTTGTCGGC TTCTCTACCT TCCCAGAAAT TTACGAAAAG SO 
ATGGAAAAGG GTCAAATCGT TAGATACGTT GTTGACACTT CTAAATAA 108 
(2) INFORMATION FOR SEQ ID NO: 7: 

5 (i) SEQUENCE CHARACTERISTICS < 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : not. relevant 

(D ) TOPOLOGY: linear 

10 (ii) HOUECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:7: 

Pro Val Pro Leu Gin Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cye 
15 10 15 

Asn Glu Aap Cya Gly Thr Ser 

15 20 

Other Embodiments 
Other embodiments are within the following claims. 
What is claimed is : 
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Claims 

1. A method of isolating a random peptide that 
binds a predetermined target protein, said method 
comprising: 

5 a) providing a library of nucleotide sequences 

encoding random peptides, each of said sequences encoding 
a candidate random peptide; 

b) cloning a sequence from said library into a 
first yeast two-hybrid expression vector to make a 

10 library test vector expressing a random peptide-scaf f old 
fusion protein comprising 

i) an N-terminal transcriptional activation 

domain, 

ii) said candidate random peptide, and 
is iii) a scaffold peptide domain? 

c) cloning a nucleotide sequence encoding a target 
peptide domain into a second expression vector to make a 
target vector expressing a target fusion protein 
comprising 

20 i) an N-terminal DNA binding domain, and 

ii) a C-terminal target binding domain; 

d) introducing said library test vector and said 
target vector into a yeast cell wherein said cell 
contains a marker gene expressed when the test fusion 

25 protein binds to the target fusion protein; and 

e) isolating said random peptide-scaf fold fusion 
protein from a cell in which said marker gene is 
expressed, 

2. A method of claim 1, wherein said random 

30 peptide sequence is inserted within a scaffold peptide 
domain. 



3. A method of claim 1, wherein said scaffold 
peptide domain is a peptide comprising SEQ ID NO: 3. 
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4. A method of claim l, wherein said scaffold 
peptide domain is a peptide comprising the Bl domain of G 
protein. 

5. A method of claim 1, wherein said scaffold 
5 peptide domain is a peptide comprising a zinc finger 

domain, the B domain of protein A of S . aureus, the tenth 
type III module of fibronectin, or cyclophilin. 

6. Substantially pure DNA of SEQ ID NO:l, or 
degenerate variants thereof, encoding an HIV-l Rev 

io binding peptide. 

7. A substantially pure peptide comprising an 
amino acid sequence substantially identical to SEQ ID 
NO: 2, which peptide binds to HIV-l Rev protein. 

8. Substantially pure DNA of SEQ ID NO:4 7 or 
15 degenerate variants thereof. 

9. A substantially pure peptide comprising an 
amino acid sequence substantially identical to SEQ ID 
NO: 5. 

10. An antibody that binds specifically to the 
20 peptide of claim 7. 

11. A vector containing the DNA of claim 6 
operably linked to transcriptional and translational 
regulatory sequences, wherein said vector expresses said 
DNA as an HIV-l Rev binding peptide in a mammalian cell. 
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12. A method of inhibiting HIV-1 replication in a 
mammalian cell, said method comprising: 

a) introducing a vector of claim 11 into a 
mammalian cell infected with HIV-1; and 
s b) expressing said HIV-1 Rev binding peptide in 

said cell at a concentration within said cell that 
inhibits HIV-1 replication such that virion production is 
reduced within said cell. 

13. A method of treating a human for HIV-1 
10 infection, said method comprising: 

a) providing a vector of claim 11 in a 
pharraaceutically acceptable carrier; 

b) introducing said vector into a human cell such 
that said DNA sequence is incorporated into the genome of 

is said cell and said HIV-1 Rev binding peptide is expressed 
in sufficient concentration in said cell to inhibit 
virion production; and 

c) administering said vector-containing cell into 
said human such that said cell and its progeny are viable 

20 in said human, wherein said cell and its progeny inhibit 
virion production . 

14. A method of detecting human immunodeficiency 
virus (HIV) in a sample of human CD4 + cells, said method 
comprising: 

25 a) providing a sample of human CD4 + cells; 

b) inducing replication of any HIV in said cells; 

c) lysing said cells and collecting a cell 
supernatant containing cellular and viral proteins; 

d) contacting said cell supernatant with a peptide 
30 of claim 7, and allowing formation of peptide-Rev protein 

complexes ; and 

e) detecting said peptide-Rev protein complexes as 

an indication of HIV in the cell sample. 
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15. A method of screening for candidate agents 
that inhibit production of HIV by inhibiting production 
of Rev protein in a human cell, said method comprising: 

a) providing a population of human cells 

5 containing actively replicating HIV in culture medium; 

b) contacting said population of cells with a 
candidate agent to inhibit Rev production; 

c) lysing said cells and collecting a cell 
supernatant containing cellular and viral proteins ; 

10 d ) contacting said cell supernatant with the 

peptide of claim 7 and allowing formation of peptide-Rev 
protein complexes; 

e) detecting said peptide-Rev protein complexes as 
an indication of Rev protein production in the cell 

15 sample; 

f ) repeating said steps a) , c) , d) , and e) to 
establish a control sample; and 

g) comparing said Rev protein production in the 
presence of said candidate agent with Rev protein 

20 production in said control sample to screen candidate 
agents that inhibit production of HIV. 
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